Module_based Analysis of Biological Data for Network Inference and Biomarker Discovery

نویسنده

  • Yuji Zhang
چکیده

Systems biology comprises the global, integrated analysis of large-scale data encoding different levels of biological information with the aim to obtain global insight into the cellular networks. Several studies have unveiled the modular and hierarchical organization inherent in these networks. In this dissertation, we propose and develop innovative systems approaches to integrate multi-source biological data in a modular manner for network inference and biomarker discovery in complex diseases such as breast cancer. The first part of the dissertation is focused on gene module identification in gene expression data. As the most popular way to identify gene modules, many cluster algorithms have been applied to the gene expression data analysis. For the purpose of evaluating clustering algorithms from a biological point of view, we propose a figure of merit based on Kullback-Leibler divergence between cluster membership and known gene ontology attributes. Several benchmark expression-based gene clustering algorithms are compared using the proposed method with different parameter settings. Applications to diverse public time course gene expression data demonstrated that fuzzy c-means clustering is superior to other clustering methods with regard to the enrichment of clusters for biological functions. These results contribute to the evaluation of clustering outcomes and the estimations of optimal clustering partitions. The second part of the dissertation presents a hybrid computational intelligence method to infer gene regulatory modules. We explore the combined advantages of the nonlinear and dynamic properties of neural networks, and the global search capabilities of the hybrid genetic algorithm and particle swarm optimization method to infer network interactions at modular level. The proposed computational framework is tested in two biological processes: yeast cell cycle, and human Hela cancer cell cycle. The identified gene regulatory modules were evaluated using several validation strategies: 1) gene set enrichment analysis to evaluate the gene modules derived from clustering results; (2) binding site enrichment analysis to determine enrichment of

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Adaptive neuro-fuzzy inference system (ANFIS) applied for spectrophotometric determination of fluoxetine and sertraline in pharmaceutical formulations and biological fluid

The UV-spectrophotometric method of analysis was proposed for simultaneous determination of fluoxetine (FLX) and sertraline (SRT). Considering the strong spectral overlap between UV-Vis spectra of these compounds, a previous separation should be carried out in order to determine them by conventional spectrophotometric techniques. Here, full-spectrum multivariate calibrations adaptive neuro-fuzz...

متن کامل

Proteomics Applications in Health: Biomarker and Drug Discovery and Food Industry

Advancing in genome sequencing has greatly propelled the understanding of the living world, however, it is insufficient for full description of a biological system. Focusing on, proteomics has emerged as another large-scale platform for improving the understanding of biology. Proteomic experiments can be used for different aspects of clinical and health sciences such as food technology, biomark...

متن کامل

Proteomics Applications in Health: Biomarker and Drug Discovery and Food Industry

Advancing in genome sequencing has greatly propelled the understanding of the living world, however, it is insufficient for full description of a biological system. Focusing on, proteomics has emerged as another large-scale platform for improving the understanding of biology. Proteomic experiments can be used for different aspects of clinical and health sciences such as food technology, biomark...

متن کامل

Introducing Potential Key Proteins and Pathways in Human Laryngeal Cancer: A System Biology Approach

The most common malignant neoplasm of the head and neck region is laryngeal cancerwhich presents a significant international health problem. The present study aims to screenpotential proteins related to laryngeal cancer by network analysis to further understandingdisease pathogenesis and biomarker discovery. Differentially expressed proteins were extractedfrom literatures of laryngeal cancer th...

متن کامل

Introducing Potential Key Proteins and Pathways in Human Laryngeal Cancer: A System Biology Approach

The most common malignant neoplasm of the head and neck region is laryngeal cancerwhich presents a significant international health problem. The present study aims to screenpotential proteins related to laryngeal cancer by network analysis to further understandingdisease pathogenesis and biomarker discovery. Differentially expressed proteins were extractedfrom literatures of laryngeal cancer th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013